SmartMTD: A Graph-Based Approach for Effective Multi-Truth Discovery

نویسندگان

  • Xiu Susie Fang
  • Quan Z. Sheng
  • Xianzhi Wang
  • Anne H. H. Ngu
چکیده

The Big Data era features a huge amount of data that are contributed by numerous sources and used bymany critical data-driven applications. Due to the varying reliability of sources, it is common to see conflicts among the multi-source data, making it difficult to determine which data sources to trust. Recently, truth discovery has emerged as a means of addressing this challenging issue by determining data veracity jointly with estimating the reliability of data sources. A fundamental issue with current truth discovery methods is that they generally assume only one true value for each object, while in reality, objects may have multiple true values. In this paper, we propose a graph-based approach, called SmartMTD, to unravel the truth discovery problem beyond the single-truth assumption, or the multi-truth discovery problem. SmartMTD models and quantifies two types of source relations to estimate source reliability precisely and to detect malicious agreement among sources for effective multi-truth discovery. In particular, two graphs are constructed based on the modeled source relations. They are further used to derive the two aspects of source reliability (i.e., positive precision and negative precision) via random walk computation. Empirical studies on two large real-world datasets demonstrate the effectiveness of our approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Value Veracity Estimation for Multi-Truth Objects via a Graph-Based Approach

A fundamental issue with current truth discovery methods is that they generally assume only one true value for each object, while in reality objects may have multiple true values. We propose a graph-based approach, called SmartMTD, to relax this assumption in truth discovery. SmartMTD models and quantifies two types of source relations to estimate source reliability precisely and to detect mali...

متن کامل

Truth Discovery from Conflicting Multi-Valued Objects

Truth discovery is a fundamental research topic, which aims at identifying the true value(s) of objects of interest given the conflicting multi-sourced data. Although considerable research efforts have been conducted on this topic, we can still point out two significant issues unsolved: i) single-valued assumption, i.e., current methods assume only one true value for each object, while in reali...

متن کامل

Exploring Relevance as Truth Criterion on the Web and Classifying Claims in Belief Levels

The Web has become the most important information source for most of us. Unfortunately, there is no guarantee for the correctness of information on the Web. Moreover, different websites often provide conflicting information on a subject. Several truth discovery methods have been proposed for various scenarios, and they have been successfully applied in diverse application domains. In this paper...

متن کامل

From Appearance to Essence: Comparing Truth Discovery Methods without Using Ground Truth

Truth discovery has been widely studied in recent years as a fundamental means for resolving the con icts in multi-source data. Although many truth discovery methods have been proposed based on di erent considerations and intuitions, investigations show that no single method consistently outperforms the others. To select the right truth discovery method for a speci c application scenario, it be...

متن کامل

A multi agent method for cell formation with uncertain situation, based on information theory

This paper assumes the cell formation problem as a distributed decision network. It proposes an approach based on application and extension of information theory concepts, in order to analyze informational complexity in an agent- based system, due to interdependence between agents. Based on this approach, new quantitative concepts and definitions are proposed in order to measure the amount of t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1708.02018  شماره 

صفحات  -

تاریخ انتشار 2017